Language Technologies in Humanities: Computational Semantic Analysis in Folkloristics

نویسندگان

  • Gregor Strle
  • Matija Marolt
چکیده

The paper discusses computational methods for natural language processing (NLP) and possibilities they offer to folkloristics. As folkloristic materials are very challenging for NLP, due to their specific semanticsyntactic structure, inherent dialectical diversity and strong intertextuality, a robust NLP method is needed that can account for topical distribution, detect general heterogeneity, and context. The focus of this paper is on computational semantic analysis (such as word-sense disambiguation, topic recognition) and its ability to uncover latent semantic structure of folkloristic corpora.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Software Projects for Developing Digital Humanities Resources

In this short paper we report on experiences gained from bachelor and master theses, and from a series of software projects conducted in cooperation with the Department of Computational Linguistics of the Saarland University. Those bachelor/master theses and software projects were dealing with the application of Natural Language Processing and Semantic Web technologies to the representation and...

متن کامل

Preferred Lexical Access Route in Persian Learners of English: Associative, Semantic or Both

Background: Words in the Mental Lexicon (ML) construct semantic field through associative and/ or semantic connections, with a pervasive native speaker preference for the former. Non-native preferences, however, demand further inquiry. Previous studies have revealed inconsistent Lexical Access (LA) patterns due to the limitations in the methodology and response categorization. Objectives: To f...

متن کامل

Data repositories in the Humanities and the Semantic Web: modelling, linking, visualising

The paper discusses the inherent potential of the Semantic Web and its related technologies for humanities research. The focal point lies on the extraction of semantic relations from heterogeneous XML based scholarly corpora using a webservice based infrastructure (XTriples). Especially the creation of methodologically distinct semantic corpora stemming from data sets originating in the humanit...

متن کامل

English and Persian Sport Newspaper Headlines: A comparative study of linguistic means

Abstract Using rhetorical figures in specialized languages like the language of newspaper headlines is common. The present study attempted to conduct a contrastive analysis of the English and Persian sport newspaper headlines related to the 2014 FIFA World Cup. Toward this end, a corpus consisting of 400 English and 400 Persian headlines published during 12th of June to 13th of July, 2014 was c...

متن کامل

English and Persian Sport Newspaper Headlines: A comparative study of linguistic means

Abstract Using rhetorical figures in specialized languages like the language of newspaper headlines is common. The present study attempted to conduct a contrastive analysis of the English and Persian sport newspaper headlines related to the 2014 FIFA World Cup. Toward this end, a corpus consisting of 400 English and 400 Persian headlines published during 12th of June to 13th of July, 2014 was c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016